Skip to main content

Web Collection 2012

Web crawl data from the year 2012. Some of this data is currently not publicly accessible.

181,548
RESULTS


web 181,548

PART OF
Web Collections
Web Crawls

TOPIC
crawldata 149,504
archiveteam 111
2011 44
Incremental crawl of the Portuguese web 44
Portuguese Web Archive 44
Portuguese online publications 44
pwacrawlid:AWP12 44
wikiteam 29
wikimedia 21
wikisource 19
data dumps 18
mediawiki 18
engadget 17
wikimediadownloads 17
www.engadget.com 17
arstechnica.com 16
theregister 13
theregister.co.uk 13
WikiTeam 12
Wikimedia Commons 12
groklaw 12
groklaw.net 12
robots.txt 12
dewikisource 9
MediaWiki 8
arstechnica articles 8
arstechnica images 8
engadget articles 8
engadget images 8
groklaw articles 8
wiki 8
arwikisource 4
groklaw pdfs 4
underground-gamer 4
underground-gamer.com 4
Data 3
Wikimedia projects pageviews 3
Wikipedia 3
analysis 3
glenn beck 3
long tail 3
statistics 3
theblaze 3
theblaze.com 3
user behavior 3
warc 3
web traffic 3
webcrawl 3
ArchiveTeam 2
Tomas M 2
Wikisource 2
Wikisource forks 2
andriasang.com 2
atarimuseum 2
atarimuseum.com 2
cdx 2
firefly 2
g4tv.com 2
image tarballs 2
internet society 2
ptwikisource 2
rwc 2
slax 2
slax.org 2
sourceswiki 2
torrentfreak 2
torrentfreak.com 2
wikimedia-mediatar 2
808 1
Agenda 21 1
Archive Team 1
Archiveteam 1
CNN 1
Captioning 1
FortuneCities 1
Fortunecity 1
Lebbeus Woods 1
MetaVidWiki 1
Muff Wiggler wiki 1
News 1
News Shows 1
SMW 1
Screenshots 1
Semantic MediaWiki 1
Semantic web 1
Substainable Development 1
Tabblo 1
Tom Merritt 1
Transcriptions 1
Transcripts 1
UN 1
Vikimedio 1
Vikipedio 1
Wikia 1
Wikimedia Italia 1
WordPress 1
abit 1
amgia 1
amgiahistory.co.uk 1
apple2history 1
apple2history.org 1
archiveteam web 1
archiveteam web human rights 1
archiveteam web newspaper montreal quebec canada 1
aspergerfoundation 1
aspergerfoundation.org.uk 1
att 1
attlabs 1
automobile 1
bad-influence 1
bad-influence.co.uk 1
cave-stg 1
commodore 64 1
commodorefree 1
commodorefree.org 1
crypto.stanford.edu 1
cryptoanarchy 1
danweinreb 1
danweinreb.org 1
detroit 1
detroiturbex 1
detroiturbex.com 1
diybookscanner 1
diybookscanner.org 1
dokuwiki 1
drivers 1
dumps 1
engadget index 1
enwikisource 1
episodes 1
esperanto 1
filesharefreak 1
filesharefreak.com 1
fireflyfans 1
fireflyfans.net 1
forumplanet.forumplanet.gamespy.com 1
forums 1
forums.gamespy.com 1
forums.nesdev.com 1
gamespy 1
gaming 1
hackaday 1
hackaday.com 1
history 1
hydriz 1
ifpi 1
ifpi.org 1
ign 1
images 1
imslp 1
japan.gamespot.com 1
kat.ph/blog/ 1
kickasstorrents 1
kiwix 1
lebbeuswoods.wordpress.com 1
midnightcode 1
midnightcode.org 1
mpaa 1
mpaa.org 1
occuprint 1
occuprint.org 1
occupywallst 1
occupywallst.org 1
open meetings 1
open video 1
public meetings 1
restoration 1
riviste di videogiochi 1
riviste informatiche 1
stillflying 1
stillflying.net 1
substainabledevelopment.un.org 1
telecomix 1
tommerrit.com 1
touchatag 1
transcripts 1
trs-80 1
trs-80.org 1
tucker 1
urinal 1
utah lighthouse ministry 1
utlm 1
utlm.org 1
vgmusic.com 1
videogames 1
videogiochi 1
vintagecomputing 1
vt100 1
vt100.net 1
website 1
websites 1
whitehousedossier 1
whitehousedossier.com 1
wikidata 1
wikipedia 1
www.apdl.co.uk 1
www.cave-stg.com 1
www.corp.att.com/attlabs/ 1
www.internetsociety.org 1
www.isoc.org 1
LANGUAGE
SHOW DETAILS
Internet Archive crawldata from Webwide Crawl, captured by crawl423.us.archive.org:wide from Tue Jan 17 08:02:53 PST 2012 to Tue Jan 17 01:16:20 PST 2012.
Topic: crawldata
Source: vkontakte.ru
Internet Archive crawldata from Webwide Crawl, captured by crawl413.us.archive.org:wide from Sat Jan 21 04:01:50 PST 2012 to Fri Jan 20 21:01:34 PST 2012.
Topic: crawldata
Wikipedia Outlinks February 2012
578,038
0
0
Internet Archive crawldata from wikipedia outbound links. captured by crawl435.us.archive.org:wpo from Thu Mar 1 20:56:37 PST 2012 to Thu Mar 1 14:19:48 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
535,201
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Tue Jan 3 12:37:06 PST 2012 to Tue Jan 3 06:31:55 PST 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl337.us.archive.org:wide from Wed Oct 17 08:14:47 PDT 2012 to Wed Oct 17 02:41:59 PDT 2012.
Topic: crawldata
Live Web Proxy Crawls
498,273
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Sat May 12 21:38:20 PDT 2012 to Sat May 12 18:43:53 PDT 2012.
Topic: crawldata
recurrence=NONE, maxDuration=259200, maxDocumentCount=null, isTestCrawl=false, seedCount=507, accountId=156, organizationName="Virginia Tech: Crisis, Tragedy, and Recovery Network", collectionId=2438, collectionName="Japan Earthquake"
Internet Archive crawldata from Webwide Crawl, captured by crawl339.us.archive.org:wide from Fri Oct 19 01:17:49 PDT 2012 to Thu Oct 18 21:38:24 PDT 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl413.us.archive.org:wide from Thu Jan 5 18:18:33 PST 2012 to Thu Jan 5 11:30:47 PST 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl413.us.archive.org:wide from Thu Jan 5 20:28:12 PST 2012 to Thu Jan 5 13:25:29 PST 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl413.us.archive.org:wide from Thu Jan 5 16:52:35 PST 2012 to Thu Jan 5 10:17:00 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
306,121
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-11-22T06:44:42 UTC to 2012-11-22T18:14:08 UTC.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl422.us.archive.org:wide from Fri Jun 8 07:28:07 PDT 2012 to Fri Jun 8 02:31:40 PDT 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl336.us.archive.org:wide from Wed Jan 18 17:01:06 PST 2012 to Wed Jan 18 09:17:47 PST 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl413.us.archive.org:wide from Thu Jan 5 15:53:45 PST 2012 to Thu Jan 5 08:51:54 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
262,103
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Tue Jan 3 14:35:17 PST 2012 to Tue Jan 3 08:02:37 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
257,948
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Tue Jan 3 11:14:32 PST 2012 to Tue Jan 3 04:36:49 PST 2012.
Topic: crawldata
Wikipedia Outlinks February 2012
256,418
0
0
Internet Archive crawldata from wikipedia outbound links. captured by crawl435.us.archive.org:wpo from Sat Feb 11 14:58:31 PST 2012 to Sat Feb 11 08:33:22 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
256,023
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Sun Feb 12 06:50:14 PST 2012 to Sun Feb 12 02:22:07 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
241,874
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-gen1.us.archive.org from 2012-06-26T02:05:01 UTC to 2012-06-26T10:40:25 UTC.
Topic: crawldata
Live Web Proxy Crawls
229,160
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-gen1.us.archive.org from 2012-07-30T04:16:42 UTC to 2012-07-30T11:28:26 UTC.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl413.us.archive.org:wide from Thu Jan 5 19:32:22 PST 2012 to Thu Jan 5 12:20:54 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
221,273
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-12-14T21:29:12 UTC to 2013-01-23T21:42:32 UTC.
Topic: crawldata
Live Web Proxy Crawls
216,428
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-12-18T16:45:10 UTC to 2012-12-19T09:20:10 UTC.
Topic: crawldata
Live Web Proxy Crawls
207,404
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-11-22T04:31:46 UTC to 2012-11-22T19:27:11 UTC.
Topic: crawldata
Wide Crawl started April 2012
206,221
0
0
Internet Archive crawldata from Webwide Crawl, captured by crawl427.us.archive.org:wide from Fri May 25 03:27:07 PDT 2012 to Thu May 24 22:54:59 PDT 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl420.us.archive.org:wide from Wed Jan 11 15:00:58 PST 2012 to Wed Jan 11 07:47:03 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
202,728
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Tue Apr 24 15:08:33 PDT 2012 to Tue Apr 24 11:01:55 PDT 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl427.us.archive.org:wide from Tue Apr 17 00:58:46 PDT 2012 to Mon Apr 16 20:37:31 PDT 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl422.us.archive.org:wide from Thu Jun 7 23:15:28 PDT 2012 to Thu Jun 7 19:17:03 PDT 2012.
Topic: crawldata
Live Web Proxy Crawls
196,197
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-10-15T10:26:27 UTC to 2012-10-15T18:41:53 UTC.
Topic: crawldata
Live Web Proxy Crawls
193,763
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-12-20T18:25:28 UTC to 2012-12-21T16:41:39 UTC.
Topic: crawldata
Live Web Proxy Crawls
184,204
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Sun Jan 1 09:53:25 PST 2012 to Sun Jan 1 03:42:17 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
181,386
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Sun May 13 12:20:19 PDT 2012 to Sun May 13 10:03:22 PDT 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl426.us.archive.org:wide from Wed Jan 11 14:29:43 PST 2012 to Wed Jan 11 07:51:53 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
175,122
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-12-14T17:34:12 UTC to 2012-12-15T10:27:19 UTC.
Topic: crawldata
Live Web Proxy Crawls
174,742
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Tue Feb 28 04:16:59 PST 2012 to Tue Feb 28 03:44:00 PST 2012.
Topic: crawldata
Wide Crawl started April 2012
174,114
0
0
Internet Archive crawldata from Webwide Crawl, captured by crawl427.us.archive.org:wide from Fri May 25 11:48:37 PDT 2012 to Fri May 25 07:42:57 PDT 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl421.us.archive.org:wide from Wed Oct 24 12:05:49 PDT 2012 to Wed Oct 24 07:00:10 PDT 2012.
Topic: crawldata
Live Web Proxy Crawls
171,469
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-12-18T15:31:35 UTC to 2012-12-19T04:58:30 UTC.
Topic: crawldata
Live Web Proxy Crawls
170,753
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-12-02T02:02:33 UTC to 2012-12-02T17:04:32 UTC.
Topic: crawldata
Live Web Proxy Crawls
169,332
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-12-31T05:25:29 UTC to 2013-01-01T02:09:59 UTC.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl413.us.archive.org:wide from Thu Jan 5 14:54:11 PST 2012 to Thu Jan 5 07:47:32 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
164,357
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-12-14T22:44:21 UTC to 2012-12-15T05:16:24 UTC.
Topic: crawldata
Live Web Proxy Crawls
164,222
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-10-29T08:21:22 UTC to 2012-10-29T16:24:07 UTC.
Topic: crawldata
Live Web Proxy Crawls
163,172
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-12-30T15:07:01 UTC to 2012-12-31T14:15:26 UTC.
Topic: crawldata
Live Web Proxy Crawls
162,082
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Mon Apr 23 17:57:46 PDT 2012 to Mon Apr 23 13:11:13 PDT 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl425.us.archive.org:wide from Wed Nov 14 17:39:11 PST 2012 to Wed Nov 14 10:55:11 PST 2012.
Topic: crawldata
Live Web Proxy Crawls
159,695
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-10-22T11:59:47 UTC to 2012-10-22T21:55:41 UTC.
Topic: crawldata
Live Web Proxy Crawls
159,208
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-10-22T13:45:05 UTC to 2012-10-22T23:54:50 UTC.
Topic: crawldata
Live Web Proxy Crawls
158,540
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-10-13T05:37:57 UTC to 2012-10-13T20:05:35 UTC.
Topic: crawldata
Live Web Proxy Crawls
157,974
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-gen1.us.archive.org from 2012-06-12T06:29:42 UTC to 2012-06-12T15:58:05 UTC.
Topic: crawldata
Live Web Proxy Crawls
156,112
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-10-29T04:58:36 UTC to 2012-10-29T15:11:48 UTC.
Topic: crawldata
Live Web Proxy Crawls
155,837
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-gen1.us.archive.org from 2012-07-27T10:56:04 UTC to 2012-07-27T17:15:08 UTC.
Topic: crawldata
Live Web Proxy Crawls
155,235
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-gen1.us.archive.org from 2012-07-03T21:29:22 UTC to 2012-07-04T04:33:10 UTC.
Topic: crawldata
Live Web Proxy Crawls
155,149
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-12-27T00:05:42 UTC to 2012-12-27T16:35:17 UTC.
Topic: crawldata
Source: google.co.jp
Live Web Proxy Crawls
152,609
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-10-30T22:58:28 UTC to 2012-10-31T22:49:55 UTC.
Topic: crawldata
Wikipedia Outlinks February 2012
152,516
0
0
Internet Archive crawldata from wikipedia outbound links. captured by crawl435.us.archive.org:wpo from Tue Jul 17 07:48:31 PDT 2012 to Tue Jul 17 01:35:31 PDT 2012.
Topic: crawldata
ameblo.jp
152,508
0
0
Source: ameblo.jp
Live Web Proxy Crawls
151,833
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-gen1.us.archive.org from 2012-09-07T11:43:02 UTC to 2012-09-07T16:16:59 UTC.
Topic: crawldata
Live Web Proxy Crawls
151,475
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-10-30T19:58:33 UTC to 2012-10-31T18:47:34 UTC.
Topic: crawldata
Live Web Proxy Crawls
150,138
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-12-27T07:49:06 UTC to 2012-12-28T07:29:11 UTC.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl339.us.archive.org:wide from Wed May 2 08:00:09 PDT 2012 to Wed May 2 01:59:58 PDT 2012.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl339.us.archive.org:wide from Mon Oct 15 01:44:34 PDT 2012 to Sun Oct 14 20:06:53 PDT 2012.
Topic: crawldata
Live Web Proxy Crawls
149,695
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-11-07T21:25:38 UTC to 2012-11-08T11:40:32 UTC.
Topic: crawldata
recurrence=NONE, maxDuration=259200, maxDocumentCount=null, isTestCrawl=false, seedCount=116648, accountId=156, organizationName="Virginia Tech: Crisis, Tragedy, and Recovery Network", collectionId=3358, collectionName="Hurricane Sandy (October 2012)"
Internet Archive crawldata from Webwide Crawl, captured by crawl339.us.archive.org:wide from Tue Oct 2 20:12:23 PDT 2012 to Tue Oct 2 17:20:31 PDT 2012.
Topic: crawldata
Live Web Proxy Crawls
146,872
0
0
Internet Archive Liveweb Capture from WayBackMachine, captured by wwwb-gen1.us.archive.org:wbm from Thu Apr 26 19:49:59 PDT 2012 to Thu Apr 26 15:19:51 PDT 2012.
Topic: crawldata
Live Web Proxy Crawls
145,514
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live0.us.archive.org from 2012-11-01T17:17:36 UTC to 2012-11-02T14:41:06 UTC.
Topic: crawldata
Live Web Proxy Crawls
144,586
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-gen1.us.archive.org from 2012-09-02T22:30:32 UTC to 2012-09-03T08:03:22 UTC.
Topic: crawldata
Live Web Proxy Crawls
144,559
0
0
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live1.us.archive.org from 2012-12-30T19:10:53 UTC to 2012-12-31T10:26:34 UTC.
Topic: crawldata